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In this report we study the problem of minimising de- 
terministic automata over finite and infinite words. 
Deterministic finite automata are the simplest de- 
vices to recognise regular languages, and determin- 
istic Biichi, Co-Biichi, and parity automata play a 
similar role in the recognition of cu-regular languages. 
While it is well known that the minimisation of deter- 
ministic finite and weak automata is cheap, the com- 
plexity of minimising deterministic Biichi and parity 
automata has remained an open challenge. We es- 
tablish the NP-completeness of these problems. 

A second contribution of this report is the intro- 
duction of almost equivalence, an equivalence class 
for strictly between language equivalence for deter- 
ministic Biichi or Co-Biichi automata and language 
equivalence for deterministic finite automata. Two 
finite automata are almost equivalent if they, when 
used as a monitor, provide a different answer only a 
bounded number of times in any run, and we call the 
minimal such automaton relatively minimal. Minimi- 
sation of DFAs, hyper-minimisation, relative minimi- 
sation, and the minimisation of deterministic Biichi 
(or Co-Biichi) automata are operations of increasing 
reduction power, as the respective equivalence rela- 
tions on automata become coarser from left to right. 
Besides being a natural equivalence relation for finite 
' automata, almost equivalence is language preserving 
for weak automata, and can therefore also be viewed 
as a generalisation of language equivalence for weak 
automata to a more general class of automata. From 
the perspective of Biichi and Co-Biichi automata, we 
gain a cheap algorithm for state-space reduction that 
also turns out to be beneficial for further heuristic or 
exhaustive state-space reductions put on top of it. 



*This work was partly supported by the Engineering and 
Physical Science Research Council (EPSRC) through the grant 
EP/H046623/1 'Synthesis and Verification in Markov Game 
Structures'; it is an extended version of [SchlOj . 



The minimisation of deterministic finite automata 
(DFAs) is a classic problem with an efficient solution 
|Hop70 IHMUOO] . This report was originally writ- 
ten with only the question in mind of whether or 
not a similar result can be obtained for deterministic 
automata over infinite words. Is their minimisation 
tractable? For weak automata, the answer is known 
to be positive [LOl] , which seems to encourage a quest 
for a tractable solution for Biichi, Co-Biichi, and par- 
ity automata as well. However, it turns out that their 
minimisation is intractable (NP-complete) . 

This raised the question whether there are 
natural tractable problems between the minimi- 
sation of DFAs and deterministic Biichi au- 
tomata (DBAs) or deterministic Co-Biichi automata 
(DCAs). The hyper-minimisation of deterministic 
automata |BC!Sn9[ IBadOQi IGJCM IHMlOj is such an 
example: If we minimise a DFA while allowing for 
a finite symmetrical difference between the language 
of the source and target automaton, we might be re- 
warded by a smaller automaton. 

We introduce a second relaxation, almost equiva- 
lence, where we require that acceptance differs only 
on finitely many prefixes of every infinite word. This 
provides the guarantee that, on each infinite run, the 
result is equivalent in almost all positions (cf. Fig- 
ure [T]), which is not only interesting in itself, but can 
also be viewed as a generalisation of the minimisation 
problem of weak automata [LC)I] to a more general 
class. 

This is a natural notion of almost equivalence 
on DFAs, which also forms a promising basis for 
state-space reduction of Biichi and Co-Biichi au- 
tomata. Different to the NP-completeness of min- 
imising Biichi and Co-Biichi automata, we show that 
finding a minimal almost equivalent DFA is cheap. 
It is also a useful starting point for a state-space re- 







Figure 1: Figure[T^ shows a minimal DFA A over the two letter alphabet {a, b}. Figure[T)3 shows a hypermin- 
isation of A, Figure [TJ; shows a minimal almost equivalent automaton to A, and Figure [T]l shows a minimal 
language equivalent DBA to A. (Neither hyperminimal nor relative minimal automata need to be unique.) 
Hyperminimal automata are the minimal automata with a finite symmetric language difference to the source 
automaton [BG S091 IBad091 IG J09j : they may only differ from the minimal automaton in the preamble. (The 
trivial SCCs of the automaton reachable from the initial state.) Minimal almost equivalent automata only 
guarantee that the symmetrical difference intersected with the prefixes of every infinite word are finite. (In 
both cases, finite implies bounded by the number of states of the automaton.) For weak automata — automata 
whose language is equivalent when read as DBA or DCA — a minimal almost equivalent automaton is also a 
minimal weak automaton. 



duction of a DBA or DCA A, because minimisation 
with respect to almost equivalence (like minimisation 
and hyper-minimisation) of A when read as a DFA 
are language preserving. 

The algorithm we develop for finding a minimal 
almost equivalent DFA allows for more: It can be 
strengthened by using language equivalence on A 
(when read as a Biichi or Co-Biichi automata) in 
the algorithm, which provides for a smaller — yet still 
language equivalent — target automaton, and this au- 
tomaton comes with the interesting property that 
one can focus on its strongly connected components 
(SCCs) in isolation when trying to reduce its state- 
space further. 

While the NP-completeness of the minimisation 
problem of DBAs, DCAs, and deterministic parity 
automata (DPAs) seems to rule out the use of state- 
space reduction on large scale problems, this reduc- 
tion technique therefore suggests that one might of- 
ten get far on the way of reducing the state-space 
without having to pay a high price, while getting for 
free a division of the remaining potential parts of the 
automaton for further reduction. 

This is fortunate, because the standard verification 
technique for the verification of Markov decision pro- 



cesses against LTL specifications |BK08j as well as 
the synthesis of distributed systems from LTL speci- 
fications tR ab72[ lPR89al IPR89b[ ISFMI [PitOTl ISchOO] 
require working with these deterministic w-automata, 
and techniques for the minimisation, or, indeed, 
for the state-space reduction of the automata in- 
volved are more than welcome. The argument in 
favour of such reductions becomes even stronger 
for algorithms that synthesise distributed systems 
|PR90l IKVOll IMTOll IWM031 IF505] , where determin- 
istic automata occur in various steps of the construc- 
tion. 

Organisation of the Report. In the following 
section, we introduce the basic notions of determinis- 
tic automata over finite and infinite words. In Section 
[3] we establish the NP-completeness of the minimisa- 
tion problems for deterministic Biichi, Co-Biichi, and 
parity automata. In Section H] we introduce the prob- 
lem of relative minimisation of deterministic finite 
automata, show that it is more powerful than hyper- 
minimisation and can be used to reduce the state- 
space of a deterministic Biichi or Co-Biichi automa- 
ton, and develop an algorithm for the relative min- 
imisation of deterministic finite automata. We then 



show that this algorithm can be strengthened further 
when used for reducing the state-space of Biichi and 
Co-Biichi automata, and that the structure of the re- 
sulting automaton is beneficial for further state-space 
reductions. 

2 Deterministic Automata 

w- Automata. Parity automata are word automata 
that recognise the w-regular languages over finite set 
of symbols. A deterministic parity automaton is a 
tuple V = (S, Q, qq, (5, tt), where 

• S denotes a finite set of symbols, 

• Q denotes a finite set of states, 

• qo & Q+ with (5+ = QLJ{_L,T} denotes a desig- 
nated initial state, 

• 6 : Q-(_ X S ^ (5-(_ is a function that maps pairs 
of states and input letters to either a new state, 
or to _L (false, immediate rejection, blocking) 
or T (true, immediate acceptance]^, such that 
5{T, a) = T and 5(_L, a) = ± hold for ah a G S, 
and 

• IT : Q+ — >■ P C N is a priority function that maps 
states to natural numbers (mapping _L and T to 
an odd and even number, respectively), called 
their priority. (They are often referred to as 
colours.) 

Parity automata read infinite input words a = 
aoaia2 ■ ■ ■ € S'^. (As usual, w = No denotes the 
non- negative integers.) Their acceptance mecha- 
nism is defined in terms of runs: The unique run 
p = rorir2 ... S Q+'^ of V on a is the tj-word 
that satisfies tq = qo and, for all z G w, rj+i = 
5{ri,ai). A run is called accepting if the highest num- 
ber occurring infinitely often in the infinite sequence 
7r(ro)7r(ri)7r(r2) ... is even, and rejecting if it is odd. 
An w-word is accepted by V if its run is accepting. 
The set of w-words accepted by V is called its lan- 
guage, denoted C{V). 



^The question whether or not an automaton can immedi- 
ately accept or reject is a matter of taste. Often, immediate 
rejection is covered by allowing 5 to be partial while there is 
no immediate acceptance. For technical convenience, we allow 
both, but treat T and ± as accepting and rejecting sink states, 
respectively. 



We assume without loss of generality that max P < 
IQI + 1. (If a priority p ^ 2 does not exist, we can re- 
duce the priority of all states whose priority is strictly 
greater than p by 2 without affecting acceptance.) 

Deterministic Biichi and Co-Biichi automata — 
abbreviated DBAs and DCAs — are DPAs where the 
image of the priority function n is contained in {1,2} 
and {2,3}, respectively. In both cases, the automa- 
ton is often denoted A = (Ti,Q,qQ,6, F), where 
F Q Q+ denotes those states with priority 2. The 
states in F are also called final or accepting states, 
while the remaining states Q-i- \ F are called rejecting 
states. 

Finite Automata. Finite automata are word au- 
tomata that recognise the regular languages over fi- 
nite set of symbols. A deterministic finite automaton 
(DFA) is a tuple F = (S, Q, qo, 5, F), where S, Q, qQ, 
and 6 are defined a for DPAs, and F C Qu{T} is a 
set of final states that contains T (but not _L). 

Finite automata read finite input words a = 
000102 ... On £ 5]*. Their acceptance mechanism 
is again defined in terms of runs: The unique run 
P = ^o^i^2 • • • i"n+i G Q+^ of J^ on a is the word that 
satisfies r^ = qo and, for all i <n, rj+i = 5{ri, Oj). A 
run is called accepting if it ends in a final state (and 
rejecting otherwise), a word is accepted by F if its 
run is accepting, and the set of words accepted by F 
is called its language, denoted C{F). 

Automata Transformations & Conventions. 

For a deterministic automaton A = {Ti,Q,qQ,5,F) 
or A= {T,,Q,qQ,5,-K) and a state q G Q+, we denote 
with Aq = {T,,Q,q,6,F) or Aq = {T,,Q,q,6,TT), re- 
spectively, the automaton resulting from A by chang- 
ing the initial state to q. We also read finite automata 
at times as Biichi (or Co-Biichi) automata and Biichi 
(or Co-Biichi) automata as finite automata in the 
constructions, and let DFAs run on infinite words 
where this is convenient and its meaning is clear in 
the context. 

Automata define a directed graph whose unravel- 
ling from the initial state defines the possible runs. 
For an automaton A = (S, Q,gO)'^)-^) or A = 
(S, Q, qo, 6, it), this is the directed graph {Q^,T) with 
T = {{p, g) G Q+ X Q+ I 3(j G S. 6{p, a) = q}. When 
referring to the reachable states (which always means 
reachable from the initial state) and SCCs of an au- 
tomaton, this refers to this graph. 



Emptiness and Equivalence. A DPA is called 
empty if its language is empty and universal if it 
accepts every word a € S'^. For two automata 
V^ = {J:,Qi,ql6i,TTi) and P^ = (S, Q2,go' -^z, vra), 
two states qi £ Qi and 52 £ Q2 are called equivalent 
if C{Vq^) = C{Vq^). (Equivalence of states naturally 
extends to the same automaton, as V^ and V"^ are not 
necessarily different.) Two automata are equivalent 
if their initial states are equivalent. (Or, likewise, if 
they recognise the same language.) 

Emptiness, universality, and equivalence of parity, 
Blichi, and Co-Biichi automata is computationally 
easy: 

Theorem 2.1 Language non-inclusion of two par- 
ity automata V^ = (S, Qi,gg, (5i,7ri) and V'^ = 
{T,,Q2,qQ, 62,112) can be checked in non- deterministic 
logarithmic space. 

Proof: We describe how to check non-emptiness of 
C{V^) \ £{V^). {C{V^) \ C{V^) can be checked ac- 
cordingly, and a non-deterministic machine can guess 
which to check.) 

C{V^) \ C{V'^) is non-empty if there is a word a = 
ooai«2 . . . € S"^ such that the run rQrJr2 ... of "P^ on 
a is accepting, while the run and rgr^r| ... of V^ on 
a is rejecting. 

A necessary condition for this is that there are po- 



^j+i> rf 



'i+i' 



and 



sitions i < j such that rj 

the highest priority in rjrj_^_i . . . r^ is even, while the 
highest priority in rfrf_^_^ . . . r? is odd. However, the 
existence of two runs with this property is also a suffi- 
cient condition for C{V^) ^ C{V'^), because the word 
a' = ooai . . . a.j_i(ajai+i . . . aj)'^ is accepted by V^ 
and rejected by V^. 

Consequently, we can use a non-deterministic ma- 
chine that guesses a' on the fly, and guesses when i 
and j are reached. All that this machine needs to 
store is 

• the current state of V^ and "P^, 

• the current guessed input letter, 

• once i is reached (guessed): rj and rf, and 

• the highest priority seen since it guessed being 
in position i for both runs. 

(Initialising the values to vr(r?^) and '7r(r?), and 
computing and storing the maximum of the cur- 
rent value and the priority of the respective cur- 
rent state.) 



Upon reaching j (guessed), the machine checks if the 
highest priority stored is even for the run of Vi, and 
odd for V2- 

The overall memory required is logarithmic in the 
size of the automaton. D 

Reachability in a directed graph can obviously be 
reduced in deterministic logspace to checking lan- 
guage non-emptiness of a Biichi or Co-Biichi automa- 
ton with only rejecting states, or non-universality of 
a Blichi or Co-Biichi automaton with only accepting 
states, respectively. Likewise, testing universality or 
emptiness can be reduced in deterministic logspace 
to checking language inclusion (in the respective di- 
rection) with a trivial Biichi or Co-Biichi automaton 
that immediately changes to T or _L, respectively, for 
every input letter. With the fact that NL is closed un- 
der complementation |Imm88| . this immediately im- 
plies: 

Corollary 2.2 Language inclusion, equivalence, 
emptiness, and universality of parity, Biichi, and 
Co-Biichi automata and their co-problems are 
NL-complete. D 



3 Minimising Biichi and Parity 
Automata is NP-Complete 

In this section we show that the minimisation of de- 
terministic Blichi, Co-Biichi, and parity automata are 
NP-complete problems. This is in contrast to the 
tractabl e min imisation of finite |Hop70| and weak au- 
tomata [LOl]. 

The hardest part of the NP-completeness proof is 
a reduction from the problem of finding a minimal 
vertex cover of a graph to the minimisation of deter- 
ministic Biichi automata. For this reduction, we first 
define the characteristic language of a simple con- 
nected graph. For technical convenience we assume 
that this graph has a distinguished initial vertex. 

We show that the states of a deterministic Biichi au- 
tomaton that recognises this characteristic language 
must satisfy side-constraints, which imply that it has 
at least 2n -\- k states, where n is the number of ver- 
tices of the graph, and k is the size of its minimal ver- 
tex cover. We then show that, given a vertex cover of 
size k, it is simple to construct a deterministic Biichi 
automaton of size 2n-\- k that recognises the charac- 
teristic language of this graph. (It can be constructed 
in linear time and logarithmic space.) Furthermore, 



we show that minimising the automaton defined by 
the trivial vertex cover can be used to determine a 
minimal vertex cover for this graph, which concludes 
the reduction. 

We call a non-trivial (|y| > 1) simple connected 
graph ^^Q = {V, E) with a distinguished initial vertex 
vq ^ V nice. As a warm-up, we have to show that 
the restriction to nice graphs leaves the problem of 
finding a minimal vertex cover NP-complete. 

Lemma 3.1 The problem of checking whether a nice 
graph Q^^ has a vertex cover of size k is NP-complete. 

Proof: As a special case of the vertex cover prob- 
lem, it is in NP, and the problem of finding a ver- 
tex cover of size k for a graph {V, E) can be re- 
duced to the problem of checking if the nice graph 
g^ = {V(j{v,v'},E VJ {{w,v} I w G FU{f'}} has 
a vertex cover of size k + 1: A vertex cover of Q^ 
must contain a vertex cover of {V^E) and v or v\ 
and a vertex cover of (V, E) plus f is a vertex cover 
of G^. U 

We define the characteristic language C{Qyg) of a 
nice graph Q^^ as the cu-language over Ft, = Fujll} 
(where t] indicates a stop of the evaluation in the next 
step — it can be read 'stop') consisting of 

1. all w-words of the form t;o*f i^V2^t'3^f4^ . . . G 
V^ with {vi-i,Vi} G E for all i £ N, (words 
where vq, f i, f2, . . . form an infinite path in Gvq), 
and 

2. all w-words starting with Wo*f i''~W2''^ • • • ^n'^'tj^n £ 
Vt|* with n G No and {fi-i,Wj} G E for all i G 
N. (Words where vo,vi,V2, ■ ■ ■ ,Vn form a finite — 
and potentially trivial — path in Q^^, followed by 
a \\ sign, followed by the last vertex of the path 

Vo,Vi,V2,...,Vn-) 

We call the w-words in (1) trace-words, and those in 
(2) \\-words. The trace- words are in V^ , while the 
l^-words are in y^'^ \ V^. 

Let ;B be a deterministic Biichi automaton that 
recognises the characteristic language of Qv^ = 
{V, E). We call a state of B 

• a v-state if it can be reached upon an input 
'„+ G ^^*, with n G No and 



• a v\\-state if it can be reached from a f-state upon 
reading a \\ sign. 

We call the union over all f-states the set of vertex- 
states, and the union over all w [^-states the set of In- 
states. 

Lemma 3.2 Let Qv^^ = {y,E) he a nice graph with 
initial vertex vq, and let B = {V, Q, qo, 5, F) be a de- 
terministic Biichi automaton that recognises the char- 
acteristic language of Qy^ . Then (1) the vertex- and 
\^-states of B are disjoint, and, for all v,w ^V with 
v ^ w, (2) the v-states and w-states and (3) the wt]- 
and w\\-states are disjoint. For each vertex v £ V , 
there is (4) a v\\-state and (5) a rejecting v-state, and 
(6), for every edge {v, w} G E, there is an accepting 
v-state or an accepting w-state. 

Proof: 



1. Let gS, be a ftj-state and q a vertex-state. As B 
recognises C{Q 
must reject it. 



recognises C{Qvq), B \i must accept v'^ , while Bq 



+,,„+ 



word vq*vi'^V2 

{vi^i,Vi\ G E for all i G N, that ends va v = Vn 

(in particular, the initial state of /B is a wo-state), 

and 



2. Let qv be a f -state and let q^ be a ti;-state with 

V ^ w. As B recognises C{Qv(,), Bq^ must accept 
tj^"^, while Bq^ must reject it. 

3. Let gS be a vtl-state and let (^ be a w\\-state with 

V ^ w. As B recognises C{Qvr,), B \i must accept 
v^, while B ti must reject it. 

Qw 

4. As Qvo is connected, there is, for every v £ V, 
a path wot'it'2 • • • w in Qy^, and the state reached 
by B upon reading f 1^2 ... wt] is a wtj-state. 

5. As Qvo is connected, there is, for every v £ V, a 
path f 01^1^2 ... w in 0^^. After reading fif2 . . .v, 
is in a u-state. B remains in f-states if it hence- 
forth reads f 's. (Note that the automaton can- 
not block/reject immediately, as it should accept 
a continuation \\v^ at any time.) As the word is 
rejecting, almost all states in the run of the au- 
tomaton are rejecting f-states. 

6. Let us consider an arbitrary edge {v,w}. As Q^^ 
is connected, there is a path from wo^i^2 • • • f in 
Qyg, and ^1^2 . . . v{wv)^ is in C{Qvf^); the run of S 
on this w-word is therefore accepting. As almost 
all states in this accepting run are f-states or te- 
states, there must be an accepting v-state or an 
accepting tu-state. D 



The sixth claim imphes that the set C of vertices 
with an accepting vertex-state is a vertex cover of 
SvQ = (V, E). It is also clear that B has at least \V\ re- 
jecting vertex-states, \C\ accepting vertex-states, and 
\V\ li-states: 

Corollary 3.3 For a deterministic Biichi automa- 
ton that recognises the characteristic language of a 
nice graph Q^q = iV^E) with initial vertex vq, the 
set C = {v ^ V \ there is an accepting v-state} is 
a vertex cover of Q^q, and B has at least 2\V\ + \C\ 
states. D 

It is not hard to define, for a given nice graph 
Gvo = (^1 E) with vertex cover C, a Biichi automa- 
ton Bp = {V^, {V X {r, ^})U(C x {a}), (z;o, r), 5, (C x 
{a})LJ{T}) with 2\V\ + \C\ states that recognises the 
characteristic language of ^^^ : We simply choose 

• 5{{v,r),v') = {v',a) if {v,v'} G E and v' S C, 
6{{v,r),v') = iv',r) if {v,v'} € E and v' ^ C, 
6(^{v, r), f') = (v, r) a V = v', 

6{{v,r),v') = {v,\]) ifv' = t], and 
6[{v,r),v'^ = _L otherwise; 

• 6[{v,a),v') = 6[{v,r),v'^, and 

• 6{{v, \\),v) = T and 6{{v, \\),v') = _L for v' ^ v. 

Bq° simply has one utl-state for each vertex v ^V 
of ^t,Q, one accepting v-state for each vertex in the 
vertex cover C, and one rejecting t;-vertex for each 
vertex f G F of Q^,^ . It moves to the accepting copy 
of a vertex state v only upon taking an edge to v, but 
not on a repetition of v. 

Lemma 3.4 For a nice graph Q^^ = (V, E) with ini- 
tial vertex vq and vertex cover C , Bq^^ recognises the 
characteristic language of Gyg . 

Proof: To show C{B(J°) C £(^^q), let us consider 

an cj-word a accepted by B^J^'' . Then it is either 
eventually accepted immediately when reading a v 
from a state {v,[\), or by seeing accepting states in 
C X {a} infinitely many times. By the construction 
of Bq*^ , a must be a ?;^-word in the first case, and a 
trace- word in the latter. 

To show C{B(J°) 5 £(^t,(,), it is apparent that 
[^-words are accepted immediately after reading the 
initial sequence that makes them tl-words, while a 



trace- word fo*" ^vi^V2'^v^^^ 



e V^ with ij G N 



and {vj,Vjj^i\ G E for all j G w, has the run p = 
{vo,ry''{vi,pi){vi,ry^-'^{v2,P2){v2,ry^~'^{v:i,P'i) . . ., 
with Pi = a (and hence {vi,pi) accepting) if Vi in 
C. As C is a vertex cover, this is at least the case 
for every second index. (There is no n G N with 
{vn,Vn+i} n C = 0.) p therefore contains infinitely 
many accepting states. D 

Corollarv 13.31 and Lemma 13.41 immediatelv imply: 



Corollary 3.5 Let C be a minimal vertex cover of 



a nice graph Q^ 



{V,E). Then B\ 
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the characteristic language of Q. 
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From here, it is a small step to the main theorem 
of this section: 

Theorem 3.6 The problem, of whether there is, for 
a given deterministic Biichi automaton, a language 
equivalent Biichi automaton with at most n states is 
NP-complete. 

Proof: For containment in NP, we can simply 
use non-determinism to guess such an automaton. 
Checking that it is language equivalent is then in NL 
by Corollary [221 

By Corollary 13. 5| we can reduce checking if a nice 
graph Gy with m vertices has a vertex cover of size 
k to checking if the deterministic Biichi automaton 
By" — which has 3m states and is easy to construct 
(in deterministic logspace) — has a language equiva- 
lent Biichi automaton with 2m + k states. As the 
problem we reduced from is NP-complete by Lemma 
13. 1| this concludes the reduction. D 

As minimising Co-Biichi automata coincides with 
minimising the dual Biichi automata, the similar 
claim holds for Co-Biichi automata. 

Corollary 3.7 The problem of whether there is, for 
a given deterministic Co-Biichi automaton C, a lan- 
guage equivalent Co-Biichi automaton with at most n 
states is NP-complete. D 

The problem of minimising deterministic parity au- 
tomata cannot be easier than the problem of minimis- 
ing Biichi automata, and the 'in NP' argument that 
we can simply guess a language equivalent DPA and 
then inexpensively check correctness (by Corollary 
12. 2p extends to parity automata. 



Corollary 3.8 The problem of whether there is, for 
a given parity automaton, a language equivalent par- 
ity automaton with n states is NP-complete. D 

Note that, while there is a minimal number of prior- 
ities required for every language, the number of states 
cannot be reduced by increasing the number of prior- 
ities, and minimising the number of priorities can be 
done in polynomial time, changing only the priority 
functions [NW981 [CM99] . 

4 Relative DFA Minimisation 

Minimisation techniques for deterministic finite 
automata can be used to minimise determin- 
istic Biichi and Co-Biichi automata. They 
are cheap — Hopcroft's algorithm works in time 
O(nlogn) [Hop70| — and have proven to be power- 
ful devices for state-space reduction. From a prac- 
tical point of view, this invites — in the light of 
the intractability result for minimising deterministic 
Biichi and Co-Biichi automata — the question if such 
tractable minimisation techniques can be used for a 
space reduction of Biichi and Co-Biichi automata. 
From a theoretical point of view, this invites the 
question of whether there are interesting tractable 
minimisation problems between the minimisation (or 
hyper-minimisation |B(JSn9l IBadOQl [GJfM IHMinQ 
of finite automata, and the minimisation of Biichi 
and Co-Biichi automata. 

Both the theoretical and the practical question turn 
out to have a positive answer: An answer to the the- 
oretical question is that we can define almost equiv- 
alence on automata and their states as a relation, 
where two automata or states are almost equivalent 
if their language intersected with the initial sequences 
of every omega word have finite difference. We show 
that a minimal almost equivalent automaton is easy 
to construct. Besides being interesting on their own 
account (for example, if we want to construct a mon- 
itor that errs only a bounded number of times for 
every input word), they are language preserving for 
deterministic Biichi and Co-Biichi automata. What 
is more, a minimal almost equivalent automata to a 
weak automaton (an automaton that recognises the 
same language as DBA and DCA) is a minimal lan- 
guage equivalent weak automaton. 

From a practical point of view, the algorithm sug- 
gests an approximation that is valid for both Biichi 



and Co-Biichi automata. There is, however, a sim- 
ple and apparent improvement of the algorithm when 
used for the minimisation of Biichi and Co-Biichi au- 
tomata: Instead of almost equivalence of states, we 
can use language equivalence for Biichi or Co-Biichi 
automata, respectively. But the algorithm provides 
for more: It isolates the minimisation problem within 
in the SCC. That is, both precise and approximative 
minimisation techniques can look into these simpler 
sub-structures. 

While being language preserving when the DBA or 
DCA is read as a DFA is a sufficient criterion for 
language preservation of the automaton itself, it is 
by no means necessary. In this context it becomes 
apparent that the NP-completeness result of the pre- 
vious section may not hint at the fact that state-space 
reduction for DBAs and DCAs is beyond price; one 
should rather take it as a hint that a high price might 
have to be paid for the additional benefit one can get 
from stronger state-space reductions than those for 
DFAs. 

However, even if we consider DFAs, there is at 
time a desire for stronger reductions than language 
preserving minimisation. For this reason, hyper- 
minimisation, the problem of finding a minimal au- 
tomaton with a finite symmetrical difference in its 
language, has been studied for DFAs |BGS09irBad091 
IGJ091 IHMlOj . In this section, we introduce rela- 
tive minimisation where we seek a minimal automa- 
ton for which the symmetrical difference intersected 
with the initial sequences of every infinite word is 
bounded. The underlying notion of approximate 
equivalence is weaker than the /-equivalence used 
for hyper-minimisation, and in my opinion it is also 
more natural even for DFAs. (One is often not re- 
ally interested in differences on words that one never 
observes.) It surely is the better starting point for 
minimising DBAs and DCAs. We develop a simple 
algorithm for relative minimisation, and discuss how 
it can be strengthened to approximate minimal DBAs 
or DCAs even better. 

Almost Equivalence. For two (not necessarily 
different) DFAs A^ = {Y.,Qi,ql,6i,Fi) and A^ = 
(S, Q2, q^, §2, F2), we call two states qi € Qi and 52 € 
Q2 almost equivalent if, for all w-words a € S"^, it 
holds that for the runs rQrlr2rl . . . and r'^rfr'^r'^ . . . 
of Ag^ and A'^^ on a, membership of the states in the 
final states is equivalent almost everywhere (3n G 
uj. Mi > n. rj G Fi <^ rf £ F2). Two DFAs are called 



almost equivalent if their initial states are, and we 
extend these definitions to DBAs and DCAs. 

Obviously, almost equivalence is a congruence and 
hence defines quotient classes on the states of au- 
tomata. It is also easy to compute: 

Lemma 4.1 Testing almost equivalence (or inequiv- 
alence) of two DFAs A and B is NL-complete, and 
the quotient class of a DFA A can be computed in 
time quadratic in the size of the automaton. 

Proof: It is simple to construct in deterministic 
logspace an automaton A j B whose states are or- 
dered pairs of A and B states, with the pair of initial 
states of A and B as initial state, whose final states 
are the pairs of a final and a non-final state (where the 
final state might be an ^ or a i3 state). Two states 
qa and qb are obviously almost equivalent if, and only 
if, the language of {Aj B)(^q^^q^) is empty when read 
as a DBA, which is in NL by Corollarv 12.21 For com- 
pleteness, it is again easy to reduce the reachability 
problem of directed graphs to refuting almost equiv- 
alence of two automata. 

This simple construction also caters for a quadratic 
deterministic algorithm for finding the quotients of 
almost equivalent states: We can construct Aj A in 
quadratic time and find the SCCs in ^ j ^ in time 
linear in ^ j ^. Two states p, q are obviously either 
almost equivalent or one can reach a final state in a 
non-trivial SCC from (p, q) in ^ j^, and these states 
can be computed in time linear in ^ j ^ by a simple 
fixed-point algorithm. D 



Finding minimal almost equivalent automata 
is tractable. We call the problem of finding a min- 
imal automaton almost equivalent to a DFA A rela- 
tive minimisation. Besides the usefulness of relative 
minimisation for DFAs themselves, let us consider 
the usefulness of relative minimisation for the state- 
space reduction of deterministic Biichi and Co-Biichi 
automata. 

Lemma 4.2 Two deterministic Biichi and Co-Biichi 
automata that are, when read as deterministic finite 
automata, almost language equivalent recognise the 
same language. 

Proof: The priority of the states in their runs differs 
in only finitely many positions. D 



We can therefore use the inexpensive DFA minimi- 
sation, hyper-minimisation (which in particular re- 
sults in an almost equivalent automaton), and the 
newly introduced relative minimisation of DFAs for 
a state-space reduction of DBAs and DCAs. This 
provides the back-bone for efficient relative minimisa- 
tion: To find, for a given DFA A = (E, Q' , q'^, 6",F'), 
a minimal deterministic automaton T> that accepts an 
almost equivalent language, we execute the following 
algorithm: 

Construction 4.3 In a first stepo, we construct 
the minimal language equivalent automaton B = 
{T,,Q,qo,5,F) in quasi-linear time using Hopcroft's 
algorithm \Hop70^ . 

For B, we then introduce a pre-order (Q+,^) on 
the states of B such that (1) two states are equivalent 
if, and only if, they are in the same SCC of B, (2) if 
p is reachable from q then p ^ q, and (3) T and _L 
are bigger than all states in Q. (This can obviously 
be done in linear time.) 

In a third step, we determine the quotient classes 
of almost equivalent states of B, and pick, for each 
quotient class [q], a representative rj^] e [q] that is 
maximal with respect to >z among the states almost 
equivalent to q. 

We then construct an automaton C = 
(S, (5,r[qg],(^', F) by choosing the representative 
rugi of the quotient [qo] of states almost equivalent to 
the initial state as new initial state, and changing all 
transitions that lead to states whose representative 
is bigger (with respect to ^) to the representatives 
of these states. That is, for 5{q,a) = q' , we get 



S'{q,cr) =q' if q 



r\nn 



and 6'{q, a) 



rrq/i otherwise. 



Finally, we minimise C using Hopcroft's algorithm 
again, yielding a DFA T>. 

Lemma 4.4 The DFAs A and T> of the above con- 
struction are almost equivalent. 

Proof: First, A and B are language equivalent. 

To compare the language of B and C, we note that, 
if p and q are almost equivalent, then so are 8{p, a) 
and 6{q,a) for all a in S. (Assuming the opposite, 
there would be a word a € S'^ for which priority of 
the runs of Bs(p,a) and Bs(q^a) differ on infinitely many 
positions, which implied the same for a ■ a and runs 
on Bp and Bq and hence lead to a contradiction.) 



^This step is not necessary for the correctness of the algo- 
rithm or for its complexity. 



Let us now consider runs rQrir2r^ ■ ■ ■ and 
r^r^ri^r^ ... of B and C on some w-word a. Then r^ 
and r? are almost equivalent for alii £ co by the above 
observation. Also, states in a run of C can never go 
down in the pre-order (Q+, ^). In particular, there is 
a bounded (at most \Q\) number of positions in the 
run, where C takes an adjusted transition — a transi- 
tion 6'{q,a) 7^ S{q,a) — as this involves going strictly 
up in {Q+, ^). The number of positions i € a; where 
either only r^ or only r? are final can thus be esti- 
mated by the number of changed transitions taken 
times the bounded number of differences that can 
occur between almost equivalent states in B. 
Finally, C and T> are again language equivalent. D 
An key observation for the proof that P is mini- 
mal is that almost equivalent states are in the same 
quotient class. 

Lemma 4.5 Two states ofD that are almost equiv- 
alent are in the same SCC. 

Proof: As T> is but the smallest automaton lan- 
guage equivalent to C, we can obtain T) by first con- 
structing a language equivalent DFA C' from C by 
deleting the unreachable states of C and then joining 
the language equivalent states. 

In C, all states that have an almost equivalent peer 
in a bigger (by ^) SCC are unreachable: In the con- 
struction of C, their incoming transitions have been 
re-routed to the representative of their class, and the 
initial state has been swapped to the representative. 
In C, all almost equivalent states are therefore in the 
same SCC. The same holds true for language equiv- 
alent states, as language equivalence is the finer re- 
lation. If two states are connected in C, the same 
holds for their quotients of language equivalent states 
in P. D 

The proof that T) is minimal builds on the fact that, 
whenever we go up in (Q+,^), we choose the same 
representative. 

Theorem 4.6 There is no DFA 8 almost equivalent 
to T> that is strictly smaller than T>. 

Proof: For convenience, we now look at quotient 
classes of almost equivalent states that cover both D 
and £ in this proof. 

First, as T> is minimal (among the language equiva- 
lent automata), all states in T> are reachable. Let us 
assume that there is a smaller DFA £ almost equiv- 
alent to T>. Then £ must (at least) have the same 



quotient classes as T>, and hence, there must be a 
particular quotient class [q] of T> (and £), such that 
there are strictly less representatives of this class in 
£ than in V. 

By the previous lemma, the representatives of quo- 
tient classes of almost equivalent states of T> are all 
in the same SCC. For trivial SCCs, this implies that 
there is only one representative in D and hence at 
least as many in £. 

For non-trivial SCCs, there is a witness of language 
non-equivalence that does not leave the SCC for all 
different occurrences. (Note that Construction 14.31 
guarantees for C that, once an SCC is left, the target 
state — and hence the remainder of the run — is the 
same, no matter from which representative of a quo- 
tient class we start. And the proof of the previous 
lemma showed that the minimisation of C is SCC 
preserving.) 

As £ has less representatives, we can pick one rep- 
resentative r G [q] of this class in D such that, for 
all representatives e G [q] in i5, we construct a finite 
word «£ G 5]* that is accepted either only by £e or 
only by T>r, such that the run of T>r on ag stays in 
the SCC containing r. This invites a simple pumping 
argument: We can construct a word starting with a 
sequence /3o that leads to r in P. It also leads to 
some state ei almost equivalent to r in £. Next, we 
continue our word with ag^, witnessing a difference. 
From the resulting state in T>, we continue with a 
non-empty sequence /3i G 11+ that brings us back 
to r. (We stay in the same SCC by construction.) 
Meanwhile, we have reached some state 62 almost 
equivalent to r in £. Next, we continue our word 
with Oea, witnessing a difference, and continue with 
a non-empty sequence /32 G S"*" that brings us back 
to r in V, and so forth. We thus create an infinite se- 
quence /3oaei/3i 062/32063 • • • with infinitely many dif- 
ferences, which contradicts the almost equivalence of 
V and £. O 

Corollary 4.7 We can construct a minimal almost 
equivalent automaton to a given DFA A in time 
quadratic in the size of A. D 

Note that the quadratic cost occurs only for con- 
structing the quotients of almost equivalent states. 
Hence, there is a clear critical path, and improve- 
ment on this path would lead to an improvement of 
the overall algorithm. 

It is interesting to observe that the minimal au- 
tomaton almost equivalent to a weak automaton 



(when read as a DFA) obtained by Construction H3] 
is weak, and a language equivalent weak automata is 
almost equivalent. (An automaton is called weak if 
it recognises the same language when read as a DBA 
or as a DCA, or, similarly, if all states in the same 
sec have the same priority.) 

Theorem 4.8 The algorithm from Construction \4 ■ 3| 
can he used to minimise weak automata. 

Proof: It suffices to show that two language equiva- 
lent (when read as DBAs or DCAs) weak automaton 
A and B are almost equivalent and that every almost 
equivalent automaton to A is weak. 

As A and B are weak, the runs of an arbitrary input 
word a € S'^ will eventually always reside in the 
same SCC. As A and B are language equivalent, all 
states in this SCC are either accepting (both for the 
respective SCC of A and of B) or rejecting. Hence, 
the finality of the states in the run may only differ 
on a finite prefix. 

Let C be an automaton almost equivalent to a weak 
automaton A. Assuming that C is not weak, it has 
a (reachable) SCC that contains accepting and re- 
jecting states. It is simple to exploit this for con- 
structing an input word a and a run p of C on a 
such that p contains infinitely many accepting and 
infinitely many rejecting states. As A is weak, a run 
p' of .4 on a will eventually always reside in the same 
SCC of A, which contains only accepting or only re- 
jecting states. Hence, only finitely many states in p' 
are accepting or only finitely many states in p' are re- 
jecting, which contradicts the assumption of almost 
equivalence to p. D 

Almost equivalence can hence be read as a general- 
isation of language equivalence of weak automata. 

Space Reduction for DBAs and DCAs. 

The techniques introduced for finding minimal al- 
most equivalent automata can easily be adjusted to 
stronger state-space reductions for DBAs and DCAs: 
If we use language equivalence for the respective au- 
tomata instead of almost equivalence, the resulting 
automaton remains language equivalent. 

Theorem 4.9 Swapping quotients of almost equiva- 
lent states for the coarser quotients of language equiv- 
alent states for DBAs and DCAs in Construction \4-^ 
provides a language equivalent automaton D, and the 
cost remains quadratic in the size of A. 



Proof: First, A and B as well as C and V are 
language equivalent as finite automata, and hence as 
DBAs and DCAs (cf. Lemma [32]). 

To compare the language of B and C, we argue along 
the same line as in Lemma 14. 4t If two states p and q 
are language equivalent, so are 6{p, a) and 6{q, a) for 
all a in S, which implies that, for runs rQrirl^r^ . . . 
and rQr^r2r^ ... of B and C on some w-word a, r\ 
and r? are language equivalent for all i G lo. The 
claim then follows again from the fact that, for every 
run of C, the states cannot go down in the pre-order 
((5+,^), and go up every time 5'{rl,ai) / K^t^'^i) 
holds. 

The complexity remains quadratic: To determine 
for a DBA the ordered pairs of states for which 
C{Bp) \ C{Bq) is non-empty, we can simply construct 
a DPA V with states and transitions like BjB, and a 
priority function that maps pairs (a, 6) to priority 3 if 
b is accepting, to 2 if a is accepting while b is not, and 
to 1 otherwise. It now suffices to construct V, delete 
the states with priority 3, determine the non-trivial 
SCCs, find states in the non-trivial SCCs with prior- 
ity 2, and then consider from which states of V they 
are reachable. (This is but the deterministic version 
of the construction from Theorem 12. II ) Two states p 
and q are obviously language equivalent if, and only 
if, C{Bp) \ C{Bq) and C{Bp) \ C{Bq) are empty. 

A similar construction can be devised for DCAs. D 

An interesting corollary from the proofs of Theo- 
rems 14.91 and 14.61 is: 



Corollary 4.10 Minimisation techniques for DBAs 
or DCAs can treat the individual SCCs of the result- 
ing automaton T> individually. D 

An interesting aspect of this minimisation is that 
we can treat a local version of weak automata: We 
call an SCC weak if all infinite paths within this SCC 
are accepting or all infinite paths within this SCC are 
rejecting. For weak SCCs, we can obviously make 
all states accepting or rejecting, respectively, without 
changing the language of a DBA or DCA. 

Doing so in the automaton C in from Construc- 
tion 14.31 leads to all states equivalent by the respec- 
tive equivalence relation (almost equivalence or lan- 
guage equivalence as DBA or DCA) becoming lan- 
guage equivalent when the automaton is read as a 
DFA, and are therefore merged in V. Thus, there is 
exactly one of these states in D, and the V is locally 
optimal. 
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A further tractable minimisation would be to greed- 
ily merge states: For an automaton A we denote with 
^pi>9 f^liQ automaton that results from changing the 
transition function 6 to 5' such that 5'{r,a) = q if 
6{r,a) = p and 6'{r,a) = S{r,a) otherwise, choosing 
q as initial state if q was the former initial state, and 
removing p from the state-space. A natural tractable 
minimisation would be to greedily consider A^'^'' for 
language equivalent states p and q until no further 
states can be merged. Note that, by Corollary 14.1^ 
it suffices to look at the respective SCCs only, which 
may speed up the computation significantly. 

This is even more important for exhaustive search 
for minimal automata, such as the SAT based meth- 
ods suggested by Ehlers [EhllOj . 



5 Discussion 

This report has two main results: First, it establishes 
that minimising deterministic Biichi, Co-Biichi and 
parity automata are NP-complete problems. 

A second central contribution is the introduction of 
relative minimisation of DFAs, a powerful technique 
to minimise deterministic finite automata when al- 
lowing for minor differences in their language. This 
natural minimisation problem on DFAs is strictly be- 
tween the problem of hyper-minimising DFAs and 
minimising DBAs or DCAs and can be viewed as a 
generalisation of the minimisation problem of weak 
automata. We show that the relative minimisation 
of DFAs is tractable and provide a simple quadratic 
algorithm. 

Finally, we strengthened this algorithm by relaxing 
the requirement for merging states from almost to 
language equivalent states, which provides a promis- 
ing technique to reduce the state-space of DBAs and 
DCAs. This technique does not only have the poten- 
tial to reduce the state-space of the automaton sig- 
nificantly, it also suffices to focus on its SCCs when 
seeking to reduce the state-space of the automaton 
further. This can be used to accelerate further re- 
duction heuristics — like the greedy merge discussed — 
and exhaustive search methods alike. 
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